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Introduction 

This  project  has  been  devoted  to  the  study  and  development 
of  the  use  of  partial  reduction  techniques  In  faotor  analysis. 
These  techniques  are  derived  from  the  area  of  multivariate 
correlational  analysis;  their  application  to  factor  analysis 
yields  oertaln  procedures  whloh  have  definite  advantages  over 
previous  conventional  methods.  Partial  reduction  techniques 
are  oloaely  related  to  the  diagonal  method  of  factor  analysis, 
a  superior  faotor  procedure  beoauae  of  Its  great  logloal 
simplicity. 

Original  objectives  of  this  phase  of  the  project  included 
the  following: 

1.  Preolse  speolf lost Ion  of  the  oharaoterlstlos  of  a 
correlation  matrix  and  of  the  variables  constituting  the  matrix 
for  whloh  the  ooomunalltlea  are:  a)  determinable;  and 
b)  unique. 


-1- 


2.  Definition  of  the  minimum  rank  of  a  matrix  in  terma  of 
the  variables  within  the  matrix  and  the  structure  of  the  under¬ 
lying  faotor  matrix. 

3.  Analysis  of  the  relationship  between  minimum  rank  and 
the  uniqueness  and  determinablllty  of  the  communalltles. 

4.  Delineation  of  the  relationship  between  the  minimum 
rank  of  a  correlation  matrix  and  the  success  with  which  partial 
reduction  techniques  can  be  applied  to  it. 

5.  Definition  of  the  relationship  between  the  uniqueness 
and  the  determinablllty  of  the  communalltles  of  a  correlation 
matrix  and  the  suoceas  with  which  partial  reduction  methods  can 
be  applied  to  it. 

6.  Determination  of  the  sampling  distributions  of  the 
statistics  whloh  were  previously  developed  and  specification  of 
any  required  standard  error  formulae. 

Research  results  have  led  to  a  concentration  of  effort  on  the 
first  four  of  these  objectives ,  since  a  satisfactory  solution  of 
the  questions  Involved  in  these  is  requisite  to  any  further  analysis. 
Accordingly,  attention  has  been  focused  upon  two  current  and  im¬ 
portant  toplos  In  factor  analysis,  both  relating  to  the  problem  of 
estimating  oommunallties.  First  is  an  attempt  to  speolfy  precisely 
the  characteristics  of  a  correlation  matrix  and  of  the  variables 
constituting  the  matrix  whloh  admit  of  communalltles  that  are 
determinable  and/or  unique.  Second  is  the  development  of  a 
refined  emplrloal  technique  for  the  estimation  of  communalltles. 


Research  Reaults 

The  oonoept  of  communallty  was  first  Introduced  by  Thurstons 
in  1935*  The  logical  elaboration  of  his  rationale  by  several 
ourrent  faotor  theorists  has  led  to  the  following  definition  and 
quantitative  conclusions: 

1.  The  communallty  may  be  defined  as  the  squared  multiple 
correlation  of  the  given  observed  variables  on  the  common  faotors. 

2.  The  squared  multiple  correlation  of  the  given  variable 
on  the  remaining  variables  must  be  the  lower  bound  to  the  com¬ 
munallty  if  the  Thurstons  model  is  appropriate  for  the  given 
matrix  (Outtman,  1956,  1957). 

3.  Further,  the  oommunallty  la  the  upper  limit  of  this 
squared  multiple  correlation  as  the  number  of  variable*  approaches 
infinity,  aeauming  constant  rank  (Outtman,  1957). 


4.  Since  the  communallty  Is  a  variance,  its  upper  limit  le 
unity,  i.e.. 


h^  ^  1.00 

5.  Unique  communalltles  can  be  obtained  only  when  the  rank 
of  the  matrix  satisfies  the  following  inequality  (Ledermann,  1937, 
Thurstone,  1947). 


^  2n  +  l-  »/8n  +  l 
_  2 


Hie  above  considerations  provide  a  point  of  departure  for 
both  theoretical  and  empirical  investigations. 

The  theoretical  investigation  of  factors  affecting  the 
determlnabllity  and/or  uniqueness  of  communalltles  was  the  subject 
of  a  Ph.D.  dissertation  by  Mrs.  Patricia  Zwllllnger.  A  copy  of 
this  dissertation  will  be  submitted  to  AFOSR  concurrent  with 
presentation  in  fulfillment  of  aoademlc  requirements.  Some  re¬ 
sults  included  here  were  taken  from  a  draft  of  this  dissertation. 

A  review  of  the  literature  indicated  that  the  properties 
required  of  an  acceptable  set  of  communalltles  may  be  grouped  in 
the  following  manner: 

1.  The  obtained  communalltles  must  be  within  the  following 
boundaries : 


0  *  Rj  ^hj  ^  1.00 


In  empirical  matrices  the  presence  of  error  leads  to  the 
omission  of  equality  signs,  giving  the  equation 

0  -s. Rj  -*rhj-*rl.00 


2.  The  factor  matrix  should  produce  the  correlation  matrix 
exactly: 


R  -  U*  -  PP' 

This  condition  Implies  Invariance  of  the  communalltles  re¬ 
gardless  of  the  order  In  which  factors  are  extracted.  Por  empirical 
matrices  the  equation  beoomes 

R  -  U*^  PP' 

This  relationship,  that  of  minimizing  residuals,  appears  to 
be  a  common  goal  of  factor  analysts. 
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3.  Minimum  rank  should  be  attained. 

4.  The  oorrelatlon  matrix  with  communalltles  In  the  diagonal 
muat  be  Oramlan. 

An  extensive  analysis  of  a  single  empirical  matrix  was  under¬ 
taken  In  order  to  oompare  the  oommunalltles  produced  by  new  and/or 
tested  methods  of  approximation  with  those  produced  by  more  popular 
factor  analytic  techniques.  The  matrix  chosen  for  extensive 
analysis  was  that  produced  by  the  MacQuarrle  Teat  for  Mechanical 
Ability.  It  was  chosen  on  the  basis  of  Its  previous  presence  In 
the  literature.  Its  simple  definition  and  Interpretation  of  Its 
variables,  and  lta  convenient  size  (7  x  7). 

The  test  consists  of  seven  subtests:  Tracing,  Tapping, 
Dotting,  Copying,  Location,  Blocks  and  Pursuit.  The  matrix  of 
correlations  among  the  seven  MacQuarrle  subtests  Is  given  In 
Table  1.  These  Intercorrelations  were  first  obtained  by  Qoodman 
(1947)  on  a  sample  of  329  radio  operators. 


table  1 


Subtest 


1 

2 

l 


Tracing 
Tapping 
Dotting 
Copying 
Location  .34 
Blocks  .41 
Pursuit  . 43 


.48 

:8 


2 

3 

4 

5 

6 

48 

.55 

.44 

.34 

.41 

47 

.47 

.31 

.29 

.29 

.34 

.43 

.32 

31 

.34 

.54 

.52 

29 

.43 

.54 

.54 

29 

.32 

.52 

.54 

.46 

29 

.36 

.48 

.44 

This  oorrelatlon  matrix  was  used  for  the  purpose  of  Illus¬ 
trating  various  methods  of  obtaining  communalltles.  The  resulting 
sets  of  communalltles  were  then  oompared  and  examined  to  see  how 
adequately  they  met  the  aforementioned  criteria  for  acceptability. 

First,  three  "exact"  solutions  of  the  MacQuarrle  matrix  were 
described,  two  of  rank  six  and  one  of  rank  five.  A  rank  four 
solution  was  then  found.  A  number  of  rank  three  solutions  already 
available  In  the  literature  were  examined  and  other,  new  rank  three 
solutions  oaloulated.  One  rank  two  and  one  rank  one  solution 
completed  the  survey.  In  eaoh  case,  the  factor  matrix  and  oom¬ 
munalltles  were  tabulated.  The  correlation  matrix  was  then 
produced  from  the  factor  matrix  and  residuals  were  oaloulated. 

A  summary  of  these  results  Is  found  In  Table  2.  Saoh  set  of  oom¬ 
munalltles  was  put  In  the  diagonal  of  the  original  oorrelatlon 
matrix  and  the  Inverse  was  taken. 
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TABLE  2 


Subtest 

1 

2 

3 

4 

5 

6 

7 

1  Tracing 

.48 

.48 

.55 

.44 

.34 

.41 

.43 

2  Tapping 

.47 

.47 

.31 

.29 

.29 

.29 

j  Dotting 

.34 

.43 

.32 

•k 

4  Copying 

.44 

Ol 

.34 

.54 

.54 

•5? 

.48 

.44 

5  Location 

•34 

.29 

.43 

.54 

6  Blocks 

.41 

.43 

.29 

.32 

.52 

.54 

•46 

7  Pursuit 

.29 

.36 

.48 

.44 

.46 

Multiple  Correlation  (R) 

.670 

.548 

.642 

.656 

.664 

.640 

.589 

Squared  multiple  R  (Ra) 

.449 

.300 

.412 

.431 

.441 

.411 

.347 

Rank  6  Communalltles 

DuBols -Sullivan  I 

.669 

.497 

.641 

.667 

.663 

.637 

DuBols -Sullivan  IIA 

.674 

.497 

.641 

.653 

.663 

.637 

Rank  5  Communalltles  . 

DuBols -Sullivan  I IB 

.674 

.497 

.641 

.536 

.820 

.506 

.488 

Rank  4  Communalltles  ... 

Modified  Tryon-Kaiser  A 

.529 

.861 

.884 

.540 

1.000 

.503 

.437 

Rank  3  Communalltles  .  . 

Modified  Tryon-Kaiser  B 

•511 

.298 

.884 

.540 

1.000 

.502 

•M 

Centroid  (Ooodman) 

.578 

.469 

•527 

.548 

.551 

.534 

Principal  Axes  (Chapman) 

.607 

.420 

.585 

.550 

.607 

.526 

.431 

Canonical 

.596 

.414 

.586 

.547 

.600 

.530 

.434 

Albert 

.§79 

.345 

.521 

.539 

.97§ 

.501 

.434 

Fortuitous 

.831 

.319 

.991 

.505 

.586 

.407 

4th  Order  Minor  Expans  lcn 

1.074 

.317 

1.814 

.555 

2.187 

-.897 

Rank  2  Communalltles 

Spearman -DuBols 

.583 

.407 

.543 

.538 

.570 

.503 

.428 

Rank  1  Communalltles 

Spearman  Single  Faotor 

.479 

.283 

.403 

.469 

.447 

.430 

.400 
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The  following  Important  results  were  noted: 

1.  Some  of  the  communallty  estimates  were  extremely  unstable 
(e.g.,  ,40 3  £  hf  .991) .  In  this  context.  It  was  pointed  out 
that  when  the  rank  of  the  matrix  Is  greater  than  half  Its  order, 
there  always  exist  an  Infinite  number  of  solutions.  However,  even 
when  the  orlterlon  of  r  ^  n/2  was  met,  extreme  variability  was 
found  (e.g.,  .551  ^  ha  ^  1.000) .  This  variability  could  not  be 
explained  satisfactorily  on  the  basis  of  error;  rather  It  pointed 
out  theoretical  contradictions  Inherent  In  the  various  methods. 

2.  The  relationship  between  rank  and  size  of  residuals 
proved  to  be  an  Inverse  one.  That  Is,  those  solutions  best  meeting 
the  minimization  of  residuals  criterion  tended  to  fall  short  on 
minimization  of  rank,  and  vice  versa.  This  emphasizes  the  need 
for  other  criteria,  such  as  Wrlgley 's  (1961)  salience. 

3.  The  practical  difficulties  Involved  In  obtaining  a 
unique  and  determinable  set  of  communalltles  were  emphasized. 

A  theoretical  analysis  of  the  following  question  was  then 
undertaken:  Under  what  conditions  may  a  correlation  matrix  be 
said  to  have  one  and  only  one  set  of  oommunalltles?  The  relation¬ 
ship  of  two  allied  concepts,  rank  and  complexity,  to  this  question 
was  also  Investigated. 

Ledermann  0-937)  and  Thurstone  (1935)  were  the  first  writers 
to  derive  expressions  defining  those  rank  conditions  under  whloh 
communalltles  could  be  unique.  Both  authors  arrive  at  the  same 
formula  although  their  lines  of  reasoning  are  somewhat  different. 
This  formula  Is  also  the  same  as  that  derived  by  Harman  0.960). 

All  three  proofs  Involve  the  same  assumptions  and  arrive  at  the 
same  conclusions.  An  adaptation  of  Thurstone's  presentation  is 
given  below. 

An  unique  configuration  of  test  vectors  determines  an  unique 
set  of  communalltles.  In  order  to  ascertain  what  rank  conditions 
will  permit  an  unique  test  configuration.  It  Is  necessary  to  know 
how  many  Independent  factors,  m,  can  be  precisely  determined  from 
any  given  correlation  matrix  of  order  n.  To  do  this,  the  number 
of  experimentally  Independent  correlation  coefficients  «ong 
variables  are  compared  to  the  nunber  of  linearly  Independent  faotor 
loadings  In  the  factor  matrix.  The  number  of  correlations  In  a 
correlation  matrix  Is 

2 

A  configuration  of  test  vectors  can  always  be  rotated  ao  that  the 
first  axis  is  collnear  with  the  first  teat  veotor,  the  second 
axis  orthogonal  to  the  first  veotor  In  the  space  spanned  by  the 
first  two  veotors,  and  so  forth.  This  operation  results  In  a  number 
of  zero  factor  loadings  In  the  faotor  matrix.  The  number  of  suoh 
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zero  loadings  Is 


m(m  -  1) 

2  * 

and  ao  the  number  of  linearly  Independent  factor  loadings  in  a 
given  factor  matrix  is 

nm  - 

A  model  correlation  matrix  and  factor  matrix  are  given  In  Table  3« 

TABLgJj 

Model  Correlation  and  Factor  Matrloes 
n  x  n  Correlation  Matrix  R 


1 

r12  r13 

rl4 

r15 

rl6 

r17 

2 

r23 

r24 

r25 

r26 

r27 

3 

r34 

r35 

r36 

r37 

4 

r45 

r46 

r47 

5 

r56 

r57 

6 

r67 

7 

n  x  m 

Paotor 

Matrix  F 

I 

II 

III 

1 

all 

0 

0 

2 

*21 

*211 

0 

3 

*31 

*311 

*3111 

4 

*41 

*411 

*4111 

5 

*51 

*511 

*5111 

6 

*6l 

*6ll 

*6111 

7 

*71 

*7H 

*7111 
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In  order  that  there  may  be  an  unique  configuration  of 
test  vectors  for  the  correlation  matrix*  it  is  necessary  for 
the  number  of  experimentally  independent  constants  (correlation 
coefficients)  in  R  to  eaual  or  exceed  the  number  of  linearly 
Independent  parameters  (factor  loadings)  in  P.  Hence, 

_  m(m  -  II  ^  n(n  -  lj. 
nm  “g  —  2 

from  which  it  can  be  seen  that  £-ma  +  (2n  +  l)m-n8+nj  ^  0. 
Solving  for  m, 

+  8p  1 1  +-r*m)(m  .  an ± i  0 

Assuming  0  m.  m  n,  the  first  expression  above  muBt  be  positive, 
therefore , 


m  u  (1) 

In  other  words,  it  is  necessary  that  the  above  rank  formula  be 
satisfied  in  order  for  a  correlation  matrix  of  n  tests  to  have  an 
unique  configuration  in  m  dimensions.  In  a  similar  manner,  it 
also  follows  from  the  original  equation  that  the  minimum  number 
of  tests  required  for  the  unambiguous  determination  of  m  factors 
is 


n  *  [&  +  1)  (2) 

A  careful  examination  of  Thurstone's  argument,  given  above, 
reveals  certain  problem  areas.  First  of  all,  it  is  clear  that 
the  above  relations  are  necessary  for  an  unique  configuration  but 
not  sufficient.  They  are  required  for  but  do  not  guarantee  an 
unique  configuration.  Further  specifications  would  be  necessary 
before  sufficient  conditions  could  be  defined.  In  addition,  the 
stipulation  is  made  that  the  correlations  be  experimentally  inde¬ 
pendent  .  It  will  be  shown  in  this  section  that  the  linear  lnde- 
pendence  of  the  correlations  is  also  an  Important  faotor  related 
to  the  uniqueness  of  communalltles.  Most  important  of  all,  the 
restriction  is  made  in  this  argument  that  the  faotor  loadings 
must  be  linearly  Independent.  However,  the  equations  relating 
the  factor  matrix  to  the  correlation  matrix  are  of  the  following 
form: 


aJIakI  +  aJIIakII  +  aJIIIakIII  "  rJk 

These  are  not  linear  equations.  Only  Outtman  (1958)  seams  to  have 
reoognlzed  this  important  fact,  and  even  he  did  not  investigate 


I 


its  consequence.  To  be  precise*  these  ere  quadratic  equations; 
that  is*  they  are  second  degree  equations*  not  first  degree 
equations*  It  would  therefore  appear  more  appropriate  if  the 
restriction  be  made  that  these  equations  have  second  degree 
independence*  rather  than  linear  independence. 


Both  Ledermann  and  Harman  base  their  proofs  on  minor 
expansion  solutions  for  communal it lea.  Ledermann  proves  that 
minors  of  order  (m  +  1)  are  linearly  independent*  while  Harman 
assumes  that  principal  minors  of  order  (m  +  1)  and  (m  +  2)  are 
linearly  Independent.  The  principal  minor  expansions*  however* 
do  not  yield  linear  equations*  since  the  principal  diagonal 
must  always  contain  the  unknown  eommunalltles.  The  first  term 
in  the  principal  minor  expansion*  then*  is  the  product  of  (m  +  1) 
or  (m  +  2)  eommunalltles,  resulting  in  a  (m  +  l)th  or  (m  +  2)th 
degree  equation.  For  example*  the  expansion  of  the  following 
3x3  principal  minor, 

hf  r12  r15 

r12  h2  r23 

r13  r23  h3 


yields  the  following  equation: 


h?h*hf  +  2r. _r.  r _  -  r?_h? 


12  3 


12  13  23 


23“1 


ri3h2  ’  rl2h3  "  ° 


This  is  a  third  degree  equation  in  three  unknowns*  not  a  linear 
equation.  Third  degree  independence  is  therefore  required* 
rather  than  linear  Independence. 

The  type  of  independence  required  for  the  above  proofs  is 
not  the  only  problem  encountered  in  these  attempts  to  define  the 
relation  between  the  rank  of  the  correlation  matrix  and  the 
uniqueness  of  the  coramunallty.  Another  question  that  can  be 
raised  is  whether  the  assumption  can  be  made  that  the  rank  of 
the  whole  correlation  matrix  is  also  the  rank  of  various  sub- 
matrices  as  well.  An  assumption  of  this  kind  will  be  shown  to 
be  necessary  if  the  above  solutions  for  the  communallty  are  to 
be  determinate.  However*  a  basic  assumption  of  factor  analysis 
is  that  the  direct  solution,  resembling  the  factor  matrix  (in 
Table  3)  used  in  Thurstone's  proof  above*  can  be  rotated  to  a 
simpler  structure.  The  underlying  proposition  involved  here  is 
that  the  complexity  of  most*  if  not  all*  variables  is  less  than 
the  rank  of  the  correlation  matrix  (Thurstone,  19*7).  An 
implication  of  this  proposition  is  that  some  of  the  factor 
loadings  in  the  unrotated  factor  matrix  are  linearly  dependent. 
Another  implication  is  that  any  attempts  to  define  the  conditions 
under  whloh  eommunalltles  are  unique  must  take  into  consideration 
not  only  the  rank  of  the  correlation  matrix  as  a  whole*  but  also 
the  complexity  of  the  variables  and  the  factors  involved  as  well. 
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When  such  complexity  wee  investigated  by  Mrs.  Zwillinger, 
certain  inconsistencies  in  the  development  of  commonly  used  oriteria 
were  noted.  As  mentioned  earlier,  the  assumption  cannot  be  made 
that  all  the  variables,  or  more  precisely,  (n  -  m)  variables,  have 
complexity  m,  if  the  ooncept  of  simple  structure  is  to  be  retained. 
This  assumption  appears  to  be  made  in  the  proofs  described  pre¬ 
viously.  Zt  is  made  explicit  in  Thurstone's  line  of  reasoning, 
but  is  not  as  obvious  a  part  of  the  arguments  of  Ledermann  and 
Harman.  It  can  be  shown,  however,  that  the  solutions  these  proofs 
are  based  on  require  that  certain  minors  of  order  m  be  non-zero 
for  all  the  oommunallties  to  be  unique.  This  conclusion  is  based 
on  the  fact  that  matrloes  can  be  constructed  which  fulfill  the 
necessary  rank  requirement  Formula  1,  but  which  still  have  variables 
with  indeterminate  oommunallties.  In  these  cases,  some  but  not  all 
of  the  oommunallties  were  indeterminate,  indicating  that  the  de¬ 
fined  rank  requirements  of  Formula  1  actually  guarantee  only  that 
unique  oommunallties  can  be  found  for  some  of  the  variables  in  the 
matrix.  For  all  the  ooomunalltles  to  be  unique,  the  factor  com¬ 
plexity  Formula  2  also  has  to  be  satisfied  by  sub-groups  of  factors. 
It  oan  also  be  shown  that  unique  oommunallties  are  not  guaranteed 
even  when  the  faotor  matrix  satisfies  both  the  rank  requirements 
of  Formula  1  as  a  whole  as  well  as  the  faotor  complexity  Formula  2 
for  all  possible  subgroups  of  factors. 

In  order  that  all  the  oommunallties  be  unique  and  determinate, 
there  must  be  at  least  n(m  x  m)  correlation  submatrices  containing 
linearly  Independent  coefficients,  one  such  matrix  per  variable, 
each  located  so  that  It  contains  none  of  the  original  diagonal 
entries  (since  these  are  unknown)  and  no  correlations  Involving  the 
associated  variable.  This  stipulation  applies  only  to  matrices  for 
which  r<n/2.  The  minor  expansion  solution  for  the  oommunallties 
must  produce  non-linear  equations  when 

n/2  £m  £  2n  .t  l 

2 

In  this  situation,  as  n  and  m  increase,  the  difficulties  in¬ 
volved  in  finding  unique  solutions  for  the  sets  of  simultaneous 
equations  Involved  multiply  rapidly. 


A  definition  of  the  necessary  and  sufficient  conditions  for  a 
set  of  unique  oommunallties  still  was  not  attained.  Such  conditions. 
If  they  could  be  specified,  would  in  essence  allow  a  variable  to 
have  as  few  non-sero  factor  loadings  as  possible  without  permitting 
any  factor  or  group  of  factors  to  be  undetermined.  It  was  shown 
that  the  well-known  rank  requirement  formula: 


a<a-t.],  |^SO lI 


(i) 


is  a  necessary  but  not  a  sufficient  condition  for  unique  oom¬ 
munallties.  This  restriction  does  not  allow  for  the  varying 
complexity  of  variables  and  of  factors  which  is  possible  if 
simple  structure  Is  to  be  assumed.  In  order  to  allow  variables 
and  faotor  loadings  to  have  as  few  non-sero  loadings  as  possible 
and  still  maintain  unique  oommunallties,  further  restrictions  are 
required.  An  additional  necessary  condition  was  proposed,  namely 
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that  the  number  of  variables ,  nf,  with  non-zero  loadings  on  a 
group  of  m‘  factors,  for  all  m'  4  m,  be 


.  .  2m»  +  1  +  */&n'  +  1 
n»  ^  * - - - , 


m» 


1,2,3, 


•  •  • ,  m* 


Each  of  these  n*  variables  must  have  a  non-zero  loading  on  at 
least  one  (but  not  neoessarlljr  more  than  one)  of  the  m*  faotors 
In  the  group.  This  condition  has  been  Informally  recognized  In 
the  literature  for  m1  -  1,  but  it  does  not  seem  to  have  been 
generalized  beyond  single  faotors. 

Even  these  two  necessary  conditions  taken  together  were 
shown  not  to  be  sufficient;  multiple  solutions  are  still  possible 
for  certain  cases  even  when  both  conditions  are  met. 

At  this  point,  sufficient  conditions  can  be  proposed. 

It  Is  contended  that  every  associated  correlation  and  factor 
matrix  meeting  the  following  three  restrictions  will  have  unique 
communallties : 

1.  Por  (n  x  n)  R  and  (n  x  m)  F,  m  <r  & 

2.  For  F,  n* >  an’,  m1  -  1,2,  ...,  m,  for  all  possible 
groups  of  m'  factors  In  F. 

3.  For  R,  there  must  be  at  least  n(m  +  l)(m  +  1)  non¬ 
zero  minors,  1  per  variable,  containing  no  elements  from  the  given 
veotor  and  no  diagonal  elements  from  R. 

These  conditions  are  not  necessary,  however,  since  unique  solutions 
can  be  constructed  which  violate  them. 

The  difficulties  encountered  when 

a.  as  n  *  a..±.  1  1 1 

2  2 

seem  to  permit  no  simple  solution.  All  theoretical  solutions 
for  the  oommunallty  become  non-linear  and  may  have  multiple 
solutions,  unique  and  multiple  solutions  for  matrices  with  the 
same  order  and  rank  which  have  this  rank  oondltlon  have  been  con¬ 
structed.  It  appears  that  the  non-linear  equations  defining  the 
faotor  loadings  In  these  oases  are  rather  complicated,  and  the 
number  of  these  equations  and  the  number  of  non-zero  minors  neces¬ 
sary  for  unique  oommunalltles  are  a  oomplex  function  of  the 
amount  by  which  the  rank  of  the  matrix  exoeeds  , 

It  would  be  well  to  point  out  that  the  faot  that  any  of  the 
statements  regarding  rank  that  have  been  made  In  this  theoretical 
will  be  applicable  only  to  the  faotor  matrices  derived 
rrom  real  data,  and  will  never  be  directly  and  precisely  appli¬ 
cable  to  empirical  matrices. 
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Concurrently  with  the  theoretical  investigation  oarried 
on  by  Mrs .  Zwillinger,  Mr*  Robert  Sullivan  was  refining  a  pre¬ 
viously  developed  method  of  estimating  communalities.  This  work 
ia  the  subject  of  a  Ph.D*  dissertation,  a  copy  of  which  will  be 
submitted  to  AFOSR  concurrent  with  presentation  in  fulfillment  of 
aoademlo  requirements. 

Historically  the  problem  of  determining  suitable  approxi¬ 
mations  of  oommunallties  arose  with  the  advent  of  multiple  factor 
analysis* 

Harman  (i960)  approaches  the  general  problem  of  the  00m- 
munallty  through  the  following  considerations: 

1.  The  conditions  that  the  correlation  coefficients  must 
satisfy  in  order  for  their  matrix  to  have  a  given  rank; 

2*  The  determination  of  oommunallty  under  the  assumption 
of  the  rank  of  the  correlation  matrix; 

3*  The  theoretical  solution  for  communallty;  and 

4.  The  approximations  to  oommunallty  without  prior  knowledge 
of  the  rank* 

He  deals  with  these  cots lderat ions  in  sequence  by  laying  down 
the  algebralo  conditions  for  given  ranks;  theoretical  and  algebraic 
solutions  of  oommunallties  assuming  rank  for  artificial  matrices, 
such  as  Albert's  method;  and  of  prime  importance  here,  the  iterative 
procedures  of  Kaiser,  Outtman,  Wrlgley  and  others  of  approximating 
communalltlea  without  prior  knowledge  of  the  rank. 

To  demonstrate  that  there  la  no  paucity  of  methods  for  esti¬ 
mating  oommunallties  a  review  pf  literature  concerning  oommunallties 
reveals  the  methods  of  estimating  oommunallties  listed  in  Table  A. 

The  intent  here  is  not  to  add  to  this  already  long  and  varied 
list  another  method  of  estimating  oommunallties,  but  to  present 
a  method  of  estimating  oommunallties  that  haa  many  unique  oharao- 
teristlos. 

Aa  mentioned  previously,  the  prlnolpal  concern  of  conventional 
factor  analysis  is  scientific  parsimony*  As  DuBola  states, 

"Host  faotor  analysts  are  probably  more  interested  in  factoring 
a  good  approximation  of  the  observed  correlation  matrix  than  they 
are  in  factoring  the  correlation  matrix  itself* "  One  of  the  main 
concerns  of  this  method  is  the  observed  correlation  matrix  and 
not  the  conventional  faotor  generated  matrix*  Thus,  the  emphasis 
is  on  preolslon  rather  than  rank  reducing  characteristic a. 

An  important  consideration,  perhaps  the  most  important,  is 
that  the  estimated  oommunallties  obtained  by  the  method  to  be 
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TABLE  4 


1.  Trial  and  Error  Exact  Formula.  Assumption  Is  that 
sampling  error  and  rounding  errors  do  not  occur. 

2.  Exact  Formula.  Applies  only  to  special  oases.  For 
example ,  a  one  factor  solution. 

3.  Successive  Approximations. 

4.  Co-linear  Graphical  Methods. 

5*  Centroid  Number  1.  Method  which  begins  with  the  highest 
correlation. 

6.  Centroid  Number  2.  Method  by  which  a  test  Is  projected 
on  a  group  of  tests  having  the  highest  correlation. 

7*  Three  Test  Formula*  Uses  a  cluster  of  three  highly 
correlated  tests. 

8.  Four  Test  Formula.  Expansion  of  the  three  test  formula. 

9*  Summation  Formula.  Takes  the  variable  that  correlated 
the  highest  with  a  given  variable. 

10.  Spearman  Formulas. 

11.  Carlson  Method.  Bnployment  of  a  graphical  procedure. 

12.  Cayley  Hamilton  Theorem.  An  algebraic  solution  for 
coomunalltles  which  states  that  any  square  matrix  satisfies  Its 
own  characteristic  equation. 

13*  A.  Albert's  Method.  A  method  of  estimating  communalltles 
by  the  linear  dependence  of  rows.  It  Is  based  upon  the  assumption 
that  the  rank  Is  less  than  half  the  order. 

14.  Multiple  R  and  successive  adjustments. 

15.  Tryon -Kaiser  Solution  for  Coomunalltles. 

16.  Modification  of  the  Tryon-Kalser  Solution. 

17*  The  Averold  Method. 

18.  Harris'  Lower  Bound. 
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described  hare  are  proper  oommunalltles .  ( Communal ity  estimates 

are  considered  proper  If  they  preserve  the  Oramlan  properties  of 
the  correlation  matrix.)  It  can  be  demonstrated  that  other 
estimates  of  communalltles  do  not  preserve  the  Oramlan  properties 
of  the  observed  correlation  matrix. 

Similar  to  the  approaches  of  Outtman,  Tryon  and  Kaiser,  this 
method  employs  an  Iterative  procedure  of  estimation,  but  Is  unlike 
them  In  that  multiple  R  Is  used  as  the  starting  point  Instead  of 
the  Squared  Multiple  Correlation  (SMC).  Computationally  It 
differs  from  these  methods  In  that  an  analytical  rearrangement 
procedure  Is  used  Instead  of  the  mathematically  Inconvenient  method 
of  solving  for  R  or  R*  by  obtaining  the  inverse  of  the  correlation 
matrix. 

The  method  developed  utilized  the  following  procedure: 

1.  With  l*s  In  the  prlnolpal  diagonals,  the  zero  order 
correlation  matrix  Is  reduced  using  pivotal  condensation. 

2.  During  the  reduotion  phases  the  matrix  Is  also  re** 
arranged  et  prescribed  times  In  order  that  all  of  the  multiple 
R's  for  all  variables  may  be  computed.  (See  Figure  1.) 

3.  During  the  Iterative  phasee  If  any  varlanoe  becomes 
negative,  the  amount  by  which  it  becomes  negative  Is  added  to 
the  varlanoe  In  question  and  the  complete  reduotion  prooess  Is 
repeated. 

4.  After  eaoh  complete  reduotion  of  all  variables,  the 
adjustment  formula  as  listed  below  is  employed  for  eaoh  variable. 

»i  -  ^vivj  -  vu  * 

where  V*  «  Varlanoe  for  a  particular  variable  previous 
1  to  reduotion. 

Vt1  -  The  unpredloted  variance  for  a  variable 
J  after  reduction. 

V"  -  The  adjusted  value  to  be  substituted  In  the 
1  matrix  for  reduotion. 

*It  Is  to  be  noted  that  the  first  reduotion  and  application 

of  the  adjustment  formula  would  give  multiple  R. 

5*  The  result  of  the  above  formula  la  then  substituted  Into 
the  diagonals  of  the  zero  order  correlation  matrix  and  the  above 
steps  are  repeated  until  there  Js  no  o hangs  in  the  values  of  the 
diagonal  elesMnta.  By  no  change  it  la  meant  that  convergence  la  ob¬ 
tained  either  Ideally  or  within  a  certain  speolfled  tolerance, 
e.g.,  .0003* 
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Pertinent  results  obtained  by  use  of  the  formula  Include 
the  following: 

1.  It  was  discovered  that  variation  of  the  order  of  reduction 
of  the  matrix  may  result  In  either  A)  no  convergence,  and  thus  no 
solution,  or  B)  alternate  sets  of  exact  communalities— that  Is, 
communalltles  which  reproduce  the  original  correlation  matrix, 
leaving  zero  residuals.  The  alternate  sets  of  communalltles  thus 
produced  are  not  necessarily  different  for  all  variables. 

Invariance  of  h5's  despite  varying  order  of  reduction  Is  obtained 
on  the  maJorltyJof  variables.  Usually,  only  one  or  two  of  the 
hj's  change. 

2.  One  of  the  prime  characteristics  of  the  method  of  esti¬ 
mation  described  herein  Is  that  the  matrix  remains  Qramlan  at  all 
times.  In  order  to  demonstrate  that  other  methods  of  estimation 
do  not.  In  general,  preserve  the  Qramlan  properties  of  the  matrix 
throughout  the  entire  estimation  procedure,  the  MacQuarrle  matrix 
was  selected  for  further  Investigation.  Communalltles  found  by 
the  following  methods  were  examined. 

A.  Centroid 

B.  Principal  Axes 

C.  Spearman 

D.  Albert 

B.  Modified  Tryon -Kaiser 

F.  Single  Factor 

Q.  Canonical 

H.  Spearman -DuBo la 

The  obtained  hf's  were  Inserted  as  diagonal  elements  In  the 
original  correlation  matrix  which  was  then  reduced.  In  all  oases 
the  matrix  became  non -Qramlan. 

3«  Analyses  of  a  variety  of  matrices  were  undertaken  In  an 
attempt  to  demonstrate  the  general  applicability  of  the  Iterated 
and  adjusted  method.  The  following  matrices  were  analyzed. 

A. 

B. 

C. 

D. 

B. 

In  all  oases,  the  Iterated  and  adjusted  method  produced  con¬ 
vergence  to  exact  solutions.  In  case  A,  alternate  sets  of  exact 
comnunalltles  were  produoed.  In  case  C,  convergence  could  not  be 
obtained  with  the  variables  In  their  natural  order.  Rearrangement 
of  the  variables,  however  permitted  an  exact  solution  of  order 
N  -  1,  Case  D  was  analysed  In  two  distinct  groups,  producing 
Identical  solutions. 


Guilford  (19^7)  Aptitude  Factor 
Analytic  Study. 

Spearman-Holzlnger  Unitary  Trait  Study 
(1934,  1935 >  1936).  (First  eight  variables.) 
Artificial  Matrix.  (Thurstone,  1933.) 
Mullen's  Matrix  (1939). 

Qulon  Matrix  (1953). 
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Attempts  to  use  the  method  on  matrices  of  higher  order 
(e.g.,  24  x  24)  were  unsuccessful,  apparently  because  the  singular 
characteristics  of  these  matrices  prevent  maintenance  of  Qramlan 
properties  throughout  the  procedure. 


Conclusions 

One  purpose  of  the  present  study  has  been  to  define  those 
conditions  under  which  a  correlation  matrix  may  be  said  to  have  one 
and  only  one  set  of  communalltles.  Practical  conditions  discussed 
In  the  literature  applicable  to  empirical  data  were  often  found  to 
be  contradictory  (e.g.,  minimized  rank  versus  minimized  residuals), 
arbitrary  (e.g.,  different  specific  orlterla  for  minimized  re¬ 
siduals),  and,  taken  together,  permitted  a  number  of  alternate  sets 
of  communalltles,  none  of  which  completely  satisfied  all  these 
criteria  for  acceptability*  In  addition,  certain  factorial  methods 
were  found  to  produce  results  which  were  not  entirely  as  residual- 
minimizing  as  they  were  reputed  to  be.  The  Ideal  conditions 
applicable  to  artificial  data  were  also  found  to  be  incomplete  and 
to  allow  multiple  solutions. 

It  appears  that  the  conditions  that  are  both  necessary  and 
sufficient  for  unique  communalltles  still  cannot  be  defined  at  the 
present  time.  This  failure  Is  In  part  due  to  the  extreme  complexity 
of  the  equations  Involved  when  the  rank  of  the  matrix  Is  within  the 
following  range: 


B  «  8n  t  *  -•yftl 
2  2 

This  may  not  be  a  serious  fault,  practically  speaking,  since  the 
matrices  found  In  empirical  Investigations  generally  are  assumed  to 
have  a  rank  below  these  limits. 

Several  conclusions  can  be  drawn  as  to  the  utility  of  the 
Thurstons  factor  model  In  view  of  the  above  results.  Plrst  of  all, 
the  model  Is  not  entirely  satisfactory  from  a  theoretical  point  of 
view,  slnoe  unique  and  determinate  solutions  cannot  always  be 
speolfled.  The  purpose  of  the  model  Is  to  provide  a  rationale  for 
the  translation  of  a  multitude  of  observed  data  to  a  modicum  of 
basic  dimensions,  and  this  transformation  must  be  logically  preolse 
and  unambiguous;  otherwise  verification  of  the  model  Is  Impossible. 
A  certain  amount  of  Imprecision  must  be  tolerated  when  dealing  with 
emplrloal  data,  but  the  variety  of  distinctly  different  solutions 
whioh  were  found  In  the  present  study  to  be  compatible  with  this 
model  appears  to  be  so  great  as  to  Impair  its  utility  seriously. 

A  seoond  purpose  has  been  to  develop  and  refine  a  feasible 
method  of  estimating  proper  oommunalltles.  A  method  utilising 
Iteration  and  adjustment  was  developed  which: 

1.  Was  demonstrated  to  have  a  high  degree  of  general  appli¬ 
cability  In  producing  exact  solutions  for  communalltles  of  order 
N  -  1  and  In  some  cases  of  order  N  -  2.  These  solutions  are  for 
empirical  matrices. 
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2.  When  successful ,  accomplishes  what  no  other  method  of 
estimating  communalltles  does  consistently;  namely,  preserves 
the  Qramlan  properties  of  the  original  correlation  matrix. 

3*  Meets  the  necessary  conditions  for  determination  of 
communalltles. 

4.  Offers  evidence  for  the  existence  of  alternate  exact 
sets  of  communalltles. 

Particular  emphasis  has  been  placed  upon  the  desirability  of 
analytic  methods  which  preserve  the  Qramlan  properties  of  the  matrix. 
It  should  be  reiterated  In  support  of  this  emphasis  that  the  com- 
munallty  itself  has  only  a  theoretical  existence;  therefore,  any 
method  of  estimating  such  a  hypothetical  construct  must  of  necessity 
remain  meaningful  throughout.  When  a  matrix  becomes  non -Qramlan,  the 
concept  of  negative  variance  Is  Introduced  and  any  results  must  be 
suspect.  This  problem  does  not  exist  when  one  Is  dealing  with 
other  than  empirical  matrices. 

Documents  produced  in  the  course  of  the  contract  work  are 
listed  In  Appendix  A. 
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